ON THE RANGE, LOCAL TIMES AND PERIODICITY OF 
RANDOM WALK ON AN INTERVAL 



SIVA ATHREYA, SUNDER SETHURAMAN, AND BALINT TOTH 

Abstract. The range, local times, and periodicity of symmetric, weakly asym- 
metric and asymmetric random walks at the time of exit from a strip with N 
locations are considered. Several results on asymptotic distributions are ob- 
tained. 



1. Introduction 

In this article, we study the range, local times, and periodicity or "parity" sta- 
tistics of nearest-neighbor symmetric, weakly asymmetric, and asymmetric random 
walks up to the time of exit from an interval of ./V sites. We derive several associ- 
ated scaling limits which appear curious, some which connect with the entropy of an 
exit distribution, generalized Ray-Knight constructions, and Bessel and Ornstein- 
Uhlenbeck square processes, among other objects. 

The study of the range of random walk is of course an old subject. However, 
examining the range and related statistics at the time the random walk leaves an 
interval, although a simple, natural concern, seems unexplored. We refer to Bass- 
Chen- Rosen p][Ch. 2] and Den Hollander- Weiss [3] for exhaustive references on 
the range and related statistics of random walk in various settings. 

From another view, indeed our initial motivation for this problem, the study of 
the range and other structures of random walk when it exits an interval can be 
thought of as a stochastic version of the "locker" problem, popular in university 
curriculum: Suppose there is a hallway of lockers labeled from 1 to N, for N > 1, 
which arc initially closed. Let persons L, for L > 1, walk through the hallway, 
toggling every Lth locker, that is opening it if closed and closing it if open. The 
question is then to find out those lockers which will be open after the first N people 
walk through. The lockers whose labels are the squares, 1, 4, 9, etc., are exactly 
those with an odd number of factors. Consequently, these lockers are the open ones. 
Other variations of this problem can be found in Tanton [7] and references therein. 

In our random walk setting, we can imagine each site in the interval to be either 
open or closed, and the random walker toggling a site on each visit (from open to 
closed and vice versa) before it exits the interval. In comparison to the "locker" 
problem, we address the following questions:- 

(1) What fraction of sites will be visited when the walker exits, e.g. the range? 

(2) How many times will each site be visited before exit, e.g local times across 
sites? 
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(3) And, given a set of sites that have been visited, what is the joint distribution 
of their open status at the time of exit, e.g parity of the visits to points in 
the interval? 

The specific answers naturally depend on the type of random walk considered. 
A goal of the paper is to see how the behaviors under symmetric and asymmetric 
walks are interpolated in terms of weakly asymmetric walks. 

For the first question, we derive the limiting distribution for the range (Propo- 
sition [OJ , and observe as a consequence, which seems surprising, that the scaled 
range, when starting at random, is uniformly distributed on [0, 1] no matter the 
dynamics (Proposition 12 . 3| . Also, curious values for the expected scaled range un- 
der symmetric walks, and the chance a given point is in the range, when starting 
at random are found (Remarks 12.21 and I2.4|) . 

For the second question, we find the scaling limit of the local times through a 
"Ray-Knight" construction involving Bessel and Orstein-Uhlenbeck squared pro- 
cesses (Propositions 13.21 13.41 and I3.5[) . 

For the third question, we show that the parities of well-separated points, given 
that they are visited, are independent and identically distributed Bernoulli vari- 
ables, and fair in the symmetric/weakly asymmetric case, and biased in the asym- 
metric situation (Proposition 14. II and 14. 2[) . 

Set up:- Let Tn = {0,1,2,..., N}. Let X n be the position of a random walk 
on Tn at times n > 1. At each time step, the walk moves to the nearest point to 
its left (right) with probability qN (pn) where pn + Qn = 1- The walk stops the 
moment it is at either or ]V. When pn = qN = 1/2, the walk is of course referred 
to as the symmetric random walk. When qN = 1/2 — c/N (and so pn = 1/2 + c/N) 
for some constant c > and N large enough so that < pn , qN < 1, we say the 
walk is weakly asymmetric. When qN = q < p = pn, the walk is asymmetric. 

Define T a = inf{n > 1 : X n = a} as the hitting time of a £ Tn- Then, 
tn = TqATn is the "exit" time from the strip. Clearly, starting from 1 < x < N—l, 
tn is finite: P x (tn < oo) = 1 where we denote P X (A) = P(A\Xq = x) as the 
conditional probability of the event A with respect to the walk starting from Xq = x. 

Then, the number of visits to y £ Tn before exiting is G(y) = X)fe=o ^y(Xk)- 
Hence, the event y is visited at all corresponds to G(y) > 1. In this case, we say 
the parity of y is "even" (locker y is closed) if G(y) > 1 and G(y) = mod2. 
Correspondingly, the parity of y is "odd" (locker y is open) when G(y) > 1 and 
G(y) = 1 mod 2 . 

The plan of the article is to address questions (1),(2) and (3) in sections [21 El 
and [?] respectively. 

2. Question 1: Range of random walk in Tn 

In this section, we obtain distributional limits of the range up to the exit time 
when starting from a point, and at random in subsections 12. 11 12.21 

2.1. The range starting from a point. Denote Rn as the number of locations 
visited before exit, the range of the walk on Tn, that is 

Rn = #{y e Tn : G{y) > 1}. 

Observe, when starting from [aN], necessarily [aN A (1 — a)N] < Rn < N. 
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Proposition 2.1. Let Xq = [aN] for < a < 1. For symmetric and weakly asym- 
metric walks, Rn/N converges in distribution to absolutely continuous measures on 
[0, 1], respectively Go. a md G c . a defined in \2.2] ) and \2.4]) - For asymmetric walks, 
Rn — [(1 — a)N] =>■ Z where Z is Geometric(q / p) . 

Proof. First, we observe, for < f3 < 1, starting from location x = [aN], since the 
motion is nearest-neighbor, 

{R N >8N} = {Rn> (3N,t n =T }U{R N > f3N,T N =T N } 

= {T[tSN] < T < T N } U {Tjv-^jv] <T N < T }. 

We now specialize arguments to the three types of random walks. 

Symmetric Walk:- When the walk is symmetric pn = <1n = 1/2, recall the 
standard Gambler's ruin identity: For a,b, z € Tzv, such that a < z < b, 

P z {T a <T b ) = b —^. (2.1) 
o — a 

For 8 > a, compute 

[aN] N - \fj N] a(l - (3) 



P[*n]{Rn > [I3N],t n =T q ) 
l, f3 > 1 — a, we have 
P [aN ](RN>[PN],TN = T N ) = 



[f3N] N (3 

When, f3 > 1 — a, we have 

N — [aN] N — [f3N] (l-a)(l-/3) 



hm P [aN] (R N /N>j3) = s 
N^oo when aV(l-a)<p<l 



[/37V] N f3 

Putting these expressions together, along with simple calculations, we have 

1 when 0</3<aA(l-a) 
aA( ^ a) when a A (1- a) < (3 <aV (1-a) 

P 

when (3 > 1. 

The right-side defines a distribution Go_ a , supported on [aA(l-a), 1] whose density 

[ " A( /3 1 2 " a) for a A (1 - a) < < a V (1 - a) 
SaOS) = < ^ for a V (1 - a) < f3 < 1 (2.2) 

otherwise. 

Weakly- asymmetric walk:- In the weakly asymmetric case, = 1/2 — c/iV and 
p N = 1/2 + c/7V with c> 0, let 

Pat 1/2 + c/N N y ' 

The corresponding gambler's ruin identity becomes, for a < z < b, 

(qN/PN) Z - (qN/PN) b 

(qn/pn) 11 - (qn /pn) 

Then, following the symmetric argument, when (3 > a 



p.p* < ^ = 7.L L~,rL i b - (2-3) 



1 _ [aJV] [/3JV] _ oN 



P [aN] (R N >WN],T = T N ) = 

1 - s L £ 1 



AT 3 AT S N 

AT 



I _ e -4ac g-4^c _ e -4c 

; := A-\(a,B,c). 
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When (} > 1 - a, 
P [aN \{RN > [/3N],Tn=t n ) 

Noting 



iJV] 



N-[PN] 



3 iV 



-4ac 



iV 

-4c 



-4(l-,8)c 



-4(l-/3)c 



1 - e 



lim P[ a m(T N <T ) = 
wc have 



lim P [aN] (R N /N>(3) 

Nfoo 



1 - e" 



-4c 



and Ai + ^4 2 = 



-4c 



4ctc^g4c(l 



A 2 (a,/3, c). 



4c0 



1 

1-e- 4 "* 

l-e 4 "' 1 — ' 
-4 Q c (e 4c(l-g)_ 1) 



when 0</3<aA(l-a) 
when a < /3 < 1 — a 
when 1 — a < [3 < a 
when a V (1 - a) < (3 < 1 



when /3 > 1. 

(2.4) 

The right-side defines a distribution G c>Ci , supported on [a A (1 — a),l], whose 
density, although messy, can be easily derived. 

Asymmetric walk:- In the asymmetric case, qjy = <Z, = P, and p > q, and it 
is not difficult to see that we cannot go left too many times. The gambler's ruin 
identity (|2~I5|) also holds in this case and, for x = [aN], P X (T < T N ) = exp{-CN} 
for some constant C > 0. 

To complete the proof, for integers z > 0, compute 



Px(Rn >N-x 



= P X (T N < T Q ,R N > N-x + z) 
= P X (T X - Z < T N < T ) + o(l) 
= (q/ P y + o(l). 



o(l) 



Remark 2.2. 1. As expected, G c . a interpolates between the symmetric and asym- 
metric cases: Namely, as c J, 0, G c>a =>• Go, a, an d as c f oo, G CiCC converges to the 
constant 1 — a. 

2. It is curious to observe, for symmetric walks, that starting from x = [aiV], 
with a G (0, 1/2], the expected range 



l— a 



a 



1 



P^d(3+ (3—d(3 



-(1 — a) log(l — a) — a log(a) 



l-a 



is the entropy of the exit distribution (1 — a, a) where 1 — a is the probability of 
leaving by the left endpoint, and a the chance of exiting right! The maximum value 
log 2 occurs when a = 1/2. 

3. For symmetric and weakly asymmetric walks, the limit distributions may also 
be derived in terms of Brownian motion and diffusion estimates. 

2.2. The range when starting at random. We derive now the limiting law of 
Rn/N when starting at random, that is the uniform distribution on Tn- It seems 
nonintuitive that the limit law is U[0, 1] no matter the type of random walk. 
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Proposition 2.3. For symmetric, weakly asymmetric and asymmetric random 
walk, when starting at random in Tn, Rn/N converges weakly to the uniform dis- 
tribution U{0, 1] . 

Proof. Suppose our starting point was random. In the symmetric and weakly asym- 
metric cases, the limiting distribution of Rn/N, from straightforward considera- 
tions, is found by integrating the density g a and tail of G CiQ with respect to a 
(denoted by G c , a ([/3, 1])). 

In the symmetric case, when j3 < 1/2, 

ot , / „, / 1 — a 



g a (f3)da = / ~^da + / da + / da 

P JB Jl-B P 



But, also, when j3 > 1/2, 



1-/3 ,-1-B -, r i 1 

a , / 1, / 1 — a 



On the other hand, in the weakly asymmetric case, we have, when /3 < 1/2, 
G c ,a{[PA])da = J ^ - e _ 4e) g da + y Ida 

1 e -4c,3( e 4c(l-a) _ ^ 
^ e -4c(l-« _ e -4c = 1 - f>- 

Similarly, when f3 > 1/2, f Q G c . Q ([/3, l])da equals 
1 



1 - e" 4 ^ 



1-/3 rP 

l-e~ 4ca da+ / e -4ca( e 4 C (i-/9) _ ^ da 

Jl-B 



e 



-4c^( e 4c(l-a) _ ^ 



= 1-/3. 



Consequently, for symmetric and weakly asymmetric walks, the limiting distribu- 
tion is U[0, 1] when the starting position is uniformly chosen. 

However, in the asymmetric case, from Proposition 12. ll Rn/N — > 1 — a in 
probability starting from x = [aTV]. Then, starting at random in T/v, we have that 
Rn/N — ► Y in probability where Y is a C/[0, 1] distributed random variable. V 

Remark 2.4. One might ask, on the other hand, with what probability a point 
y = [f3N] belongs to the range when starting at random. This is the same as asking 
when y is visited by the walk. For symmetric walk, it is not difficult to use the 
gambler's ruin identity (|2.1[) to see, as N f oo, that the probability tends to 

1 1 - a , f p a , 1 
-da + / — da = — . 

p 1-/3 io £ 2 

It seems curious that the limit does not depend on (3. 

For asymmetric walk, starting from [oJV], when a > j3, the point y cannot be 
reached with positive probability in the limit. Then, the chance y belongs to the 
range, when starting at random, is /?. 

For weakly asymmetric walks, using (|2.3p . the limit is 1 _f-4 C $ — 1 _ e 1 4 7(f-g) which 
interpolates between the other cases as c 4- and c f oo 
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3. Question 2: Characterization of local times 

To capture the local times of the random walk before its exit, we use the "Ray- 
Knight" or "Kesten-Kozlov-Spitzer" representation, and some martingale charac- 
terizations. Our treatment and proofs will be similar to those in Toth [8] which 
considered certain self-interacting random walks. 

Let < a < 1. Suppose the walk starts at [a AT], and exits at the right endpoint 
N. Let be the number of left crossings of the bond (N — (j — 1), N — j) before 
exit. Then, = 0, and is distributed as Dn, a Geometric ( qpf) random variable 
minus 1, P{D N = ri) = Pn<1n for n> 0. In the following, we drop the script N. 

Let {£j,i}i,j>o be i.i.d. random variables with distribution D^. A moment's 
thought convinces that {Cj}o<j<N is a Markov chain with representation 

torO<j<[(l-a)N] (31) 
' ' for [(1 - a)N] <j<N-l y ' 

such that 

Q = for some [(1 - a)N] < j < N, (3.2) 

with the convention that empty sums vanish. 

Note that for j < [(1 — a) AT], the sum starts with index i = since, even if 
(j = 0, given exit at the right, the walk must visit locations [a AT] < x < N and 
may have left crossings of (x — l,x). However, for j > [(1 — a) AT], since the walk 
is not guaranteed to visit sites to the left of [a AT], Q is the size of a Branching 
process, with initial value Cf(i-a)iVI) which must vanish before time j = N. 

Then, the local time of the walk is 




G(y) 



Cn- v for < y < [aN] 
( N - y + 1 for [aN] <y < N. 



In the following, to analyze {Cj}o<j<N, it will be helpful to consider the Markov 
chain r)j, such that 770 = and 771 = Djv, for which representation fl3.1[) holds in 
terms of the variables {£j,i}i,j>o, but without the restriction p.2[) . 

When the walk exits at the left endpoint 0, one considers an analogous Markov 
chain Q, corresponding to right-crossings of (J,j + 1), where the representation 
and restriction are reversed. Namely, let Dn be a Geometric (pn) random variable 
minus 1, P(£>n = n) = qNP% for n > 0. Define ( = 0, £1 = Djf, and 




for < j < [aN] 

for [aN] < j < N — 1 



such that Q = for some [aN] < j < N. The local time of the walk in this case is 
G(y) = C y for [aN] < y < N and G{y) = ( y + 1 for < y < [aN]. Here also it will 

be of use to define analogously a Markov chain fjj satisfying fjo = 0, fji = Dn, and 
the reversed representation but without the restriction that the chain must vanish 
for [aN] <j<N. 

Finally, define Y/v(t) = jfV[Nt] an< A Y^(t) — jrfj[m] for < f < 1, and suppose 
that Fjv(0) = Fjv(0) = 0. 
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3.1. Symmetric walks. Consider the following processes. Let Zq — 0, and define 




£ + Jl y/2Z~ s dB s for < £ < 1 - a 
Zi-a + fi_ a V2ZldB a for 1 - a < £ < 1. 



Observe that Zt for < £ < 1 — a is the same in law as Besq 2 (£/2) process, and a 
Besq°(£/2) process for 1 — a < t < 1 (cf. Revuz-Yor [B] for more on the processes 
Besq 5 dX t = 5dt + 2^fX~ldB t ) . Let Tq be the first time Z t hits after time £ = 1 — a. 
Note that Z t remains at value after time Tq*. 
Define also Z t where Zq = and 

^ ( £ + f Q \fW s dB s for < £ < a 
\ Z a + /* \plZ s dB s for a < i < 1. 

Let also be the time Z t reaches after time £ = a. Here, also, Z t = for £ > . 

It will turn out that Z t and Z t will be identified respectively, as the scaling limits 
of the local times when the random walk exits at the right and left endpoints of 
the interval. The important point in this identification is the next result. 

Proposition 3.1. For symmetric walk starting from x = [<xN\, we have 

Y N (t) => Z(t) and Y N (t) => Z(t) 

in D[0, 1], in the sup topology. 

Instead of proving Proposition 13. 1[ which follows steps as in Toth [8] , we prove 
Proposition 13 . 31 in the next subsection, with respect to weakly asymmetric random 
walks, dealing with squared Ornstein-Uhlenbeck processes which are less standard. 

Now, with Proposition 13 . 1 1 in hand, since ljv(i) and ^at(£) converge respectively 
to Zt and Z t in the sup topology, it follows that the conditional distributions of 
Yat(£) given rjj vanishes for j > [(1 — a)N] and Y]y(t) given fjj vanishes for j > [aN] 
converge to the conditional distributions of Z t given that 1 — a < Tq < 1 and Z t 
given that a < Tq < 1. 

Hence, from this discussion, the following characterization holds for the local 
times of the walk up to time of exit. Recall that 1 — a and a arc the exit probabilities 
of right and left exit respectively. 

Proposition 3.2. For symmetric walk starting from [aN], the local times 

G([Nt])/N => a/j H + (l-a)n L 

where [i R is the law of the process Z\_t conditioned on 1 — a < Tq < 1, and /i L is 
the law of the process Z t conditioned on a < Tq < 1 . 

3.2. Weakly asymmetric walks. The development of the local time structure 
is similar to the symmetric case. Corresponding to right exit, EDn = Qn/pn = 
1 ^nT2^. and Var(Djv) = q 2 N /p 2 N + q N /p N = 2 - + ^+% c )^ ■ Define the process 
Z$ by Z% = 0, and 




/ *(1 - AcZ^)ds + $1 ^/TZ~ldB s for < £ < 1 - a 
Z c i- a - Sl- a * cZ s ds + Sl-a V^dB s for 1 - a < t < 1 
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Note 2(Z t c + 1) and 2(Z f c — t) are the squares of the Ornstein-Uhlenbeck process 
dX t = -AcX t dt + ^/2dB t for < t < 1 - a and 1 - a < t < 1 respectively. 

Also, with respect to left exit, ED N = pn/(1n = 1 + 4c/iV + 0(N~ 2 ) and 
Var(^jv) = 2 + OiN" 1 ). Define Z c t by Z§ = and 



2 t c 



/ (1 + 4cZ s c )ds + J J2Z%dB s for < i < a 
Z c a + J2Z~^dB s for a < i < 1. 



As before, let Tq~ be the first time after t = 1 — a that Z£ reaches 0, and Tq be 
the first time after t — a that hits 0. 

Analogous to the symmetric random walk case, we show that Z£ and Z£ are the 
scaling limits of the local times when the weakly asymmetric random walk exits at 
the right and left endpoints respectively. 

Proposition 3.3. For the weakly asymmetric random walk starting from x = [aN], 
we have 

Y N (t) =}► Z c {t) and Y N {t) ^ Z c {t) 

in D[0, 1] ; in the sup topology. 

The same argument as in the symmetric case allows to deduce the the following 
characterization. 

Proposition 3.4. For the weakly asymmetric walk, starting from x = [aN], the 
local times satisfy 

G([Nt])/N R(a)^ + (l-R(aM 

where ^ is the law of the process Zf_ t conditioned on 1 — a < Tq < 1, and (j£ is 
the law of the process Z£ conditioned ona < Tq < 1. Here, R(a) = (1 — e~ 4ca )/(l — 
e _4c ) is the exit probability to the right. 

Proof of Provosition WT^ Here, we argue that ljv(i) => Z c (t) which corresponds 
to "left crossings." The argument for Ypj(t) Z c (t) is similar. 

The proof naturally separates into two parts corresponding to when j < [(1 — 
a)N] and j > [(1 — a)N]. The strategy will be to use martingale decompositions 
of the Markov chain {rjj}j>o- Define, for [Nt] < [(1 — a)N], the martingale and its 
quadratic variation, 

[Nt]-1 

M N (t) = r) [Nt ]-r)o- ( E [Vj+i\Vj] - Vj) 

3=0 

[Nt]-1 

(M N (t)) = J2 E[{r l]+1 -E[i lj+1 \i 1] ]) 2 ]. 

3=0 
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Then, 
1 



[Nt]-1 



N 



M N (t) 



Y N (t)-Y N (0)-± {E{D N )( V] + l)- m ) 



3=0 
[Nt]-1 



Y N (t)-Y N (0)-± J2 ((!- 

3=0 

Y N (t)-Y N (0)-±[Nt] + 



4c 



N + 2c 

[Nt]-1 



and 



[Nt]-1 



N + 2c 



E 



4c[ATi] 



AT/ iV(7V + 2c) 



^ £;[( %+1 - J B( J D JV )(% + i)) 2 te] 



iV 2 



i=o 
[Nt]-i 



J2 E * 



iV 2 



3=0 
[Nt]-1 



J2^3,i-E(D N ))\ \ m 



\i=0 



^2 E ^ + !) Var (^) 



3=0 
[JVt]-l 



(3.3) 



Now suppose yiv(t) and iV 1 M^(t) are tight in the sup topology, and Y^(t) => 
Z t , iV _1 Mjv(i) => M(t) on subsequences. Then, M(t) = Z t -Z - j *(l - 4cZ s )ds 
and (M(t)) = 2 J Q * Z s ds. Hence, by Levy's criterion for continuous martingales, we 
have that Z t is uniquely characterized by 



Z t =Z + [ (l-4cZ s )ds+ f \/2Z~ s 
Jo Jo 



Similarly, for [Nt] > [(1 — a)N], since now r)j + i = YliLi£j,ii the drift is not 
present, and we can write 

[Nt]-1 

-{M N {t)-M N {l-a)) = Y N (t)-Y N (l-a)+ ]T (E( Vj+1 \ Vj ) - Vj ) 



j=[(l-a)N] 

[Nt]-1 



and also 



Y N (t)-Y N {l-a) + 



[Nt]-1 



4c 



E y» 



j=[N(l-a)] 



N 



l((M N (t))-{M N (l-a))) = ± + l)Var(DAr) 

3=[JV(l-a)] 
o I^*]" 1 / ■ \ / 1 

- If E 



j=[JV(l-a)] 
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Hence, as before, given tightness of TV 1 Miv(t), and sub-sequential convergences 
Y n {t) => Z(t) and N^Mn^) M(t) on [1 - a,l] where M(t) - M(l - a) = 
Z t - Zi_ Q and (M{t) - M(l - a)) = 2 /*_ q Z s ds, we conclude, for i G [1 - a, 1], 
that 

ft 



2Z edB e 



Consequently, it follows, putting the subsequential converges together, for < t < 1 
that Y]y(t) converges weakly to Z t . 

Tightness. We now argue tightness of Yjv(i) and 7V _1 Mjv(i) on [0, 1 — a]. Tight- 
ness of Yjf(t) follows from tightness of iV -1 M^(t) in the sup topology which can 
be argued by a Kolmogorov-Centsov argument. First, for a general discrete time 
martingale (M(l), J r i) with difference 8(1) = M(l) — Mil — 1), we have that 

I 

E[(M(l)-M(k)) A ] = 6 ]T E [8(j) 2 (M(j - 1) - M(k)) 2 ] 

j=fe+i 

i i 
+4 £ E[S(jf(M(j-l)-M(k))]+ £ E i 6 t] 

j=k+l j=k+l 

and by Jensen inequality, 

I 

E[(M(l)-M(k)f] < 6 Y, E[E[S(j) 2 \^ j - 1 ](M(j-l)-M(k)f] 

1/2 



j=k+l 



+ 4 ]T {^[B[<y(j) 4 |^-i] 3/2 (M(j-l)-M(fe)) 2 ]} 
i=fc+l 

+ £ E[E[8U) A \^-i]]. 

j=k+l 

Now, in our context, define the martingale, for I < [(1 — a)iV], 

i-i 



M(0 = ^-770-^(^^+1!^]-%) 



i=0 

so that Mff(t) = M([Ni\), and also the stopping time 

6 y , N = inf{Z > : 77/ > ATy}. 
Compute, with respect to M(l A v ,n), that 
= M(lA6 yiN )-M(l-lA9y tN ) 

= ViA0 N>y - E[rn A e ViN \rii- 1A yiN ] = Y, (£i/\e y , N ,i - E(D N )). 



i=0 



Hence, we have 



J5 



^ (6A^,i-^Jv)) 



i=0 



/\0„ 



< Va-r(D N )rn_ 1A e y N < ciNy 
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and 



E 



i=0 



< {c 2 ED A N + l){{Nyf + Ny). 
Also, noting the quadratic variation estimate p. 31) , 
1 



N i E 



(M(j - 1 A e N , v ) - M (k A e N . y )f < ca\j -k\[j^+y 



1 



Hence, we have, for some constant C4 not depending on N or y, that 

-L E [(M([Nt]A6 y . N )-M([Ns]A9 y , N )) i ] < c 4 maxjy 2 , l}(\t - s\ 2 V ^). 

Then, by Theorem 12.3 Billingsley [2], N~ 1 M{[Nt] A A^jy) is tight for any y < 00. 
Hence, N~ 1 AlN(t) is tight in the sup topology on [0, 1 — a]. 

Tightness with respect to the interval [1 — a, 1], and consequently the whole 
interval [0,1] follows similarly. V 

3.3. Asymmetric walks. The situation is much different for asymmetric walks, 
in particular, the local times are of order 0(1), and no scaling is required. Given 
p > q, the walk starting from x = [aN] will exit to the right with probability 
tending to 1 as N j" 00. The sequence rjj for 1 < j < [(1 — a)N] is a branching 
process with mean offspring EDn = q/p < 1 and immigration at each time of 
one individual. The initial population is rji with the distribution of D = Dn, a 
Geometric((?) random variable minus 1. Hence, this sequence is a positive recurrent 
Markov chain, and T][(i- a )N\ converges to the stationary distribution 7r. 

On the other hand, the chain r/j for j > [(1 — a)N) is the usual Branching process 
with offspring distribution Dn (and no immigration). Hence, it dies out in finite 
time. 

The stationary distribution n can be described by its probability generating 
function ^(s) = J2k>o 7T (k)s k . Let <fi(s) be the probability generating function of 
D. Then, easy computations give that ^(s) = ^(^(s))^^). 

Hence, since the distribution of ?7[(i- Q )jv] converges to 7r, we can state a limit 
characterization in terms of a reversed process. 

Proposition 3.5. Consider the asymmetric walk when p > q starting from [aN]. 
For any M > 1, the reversed process {/3& = T][{i-a)N]-k}kLo converges in distribu- 
tion to the reversed process starting from the stationary distribution tt of the chain. 

However, {/3_fc = t/[(i— Q )Af]+fc}fc=o converges to a Branching process with off- 
spring distribution D starting from tt. 



4. Question 3: Periodicity 

We now address the parity of various well-separated locations visited by the 
walk before exiting. We remark different types of multiple point structures in other 
settings have been studied in Hamana [4] and Pitt [5]. 

Let < ai < a 2 < ■ ■ ■ < a k < 1, and e H G {0, 1} for 1 < i < k. 
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Proposition 4.1. With respect to symmetric or weakly asymmetric walks, for a € 
(0, 1), we have 

Jim P [aN] (n£ =1 {G([a i iV]) = e t mod 2 }| max T< N] < t n ) = ^. 

N~>oo \ l<i<k J Z 

In other words, in the symmetric or weakly asymmetric cases, given that the 
locations arc visited, the parities at {[c^iV]}*^ converge to i.i.d. fair Bernoulli 
random variables. 

But, with respect to asymmetric walks when p > q, starting from [aiV], unless 
a < /?, [/3iV] is not visited with probability tending to 1. So, it makes sense only to 
discuss parities of sites to the right of [aN]. 

Proposition 4.2. With respect to asymmetric walks when p > q, suppose < a < 
a\ . Then, 

lim o P [aN] (nl 1 {G([a i N}) = lmod 2 }) = — J_ g))fc - 

By the inclusion-exclusion principle, one concludes, in the asymmetric situation, 
the parities at {[a:i-/V]}k =1 converge to i.i.d. Bernoulli random variables with success 
probability (2 — (p — qj)^ 1 ■ 

We remark, with respect to the 'stochastic locker' intcprctation, one concludes 
that the expected proportion of lockers left closed is half or (2 — (p — q))^ 1 times 
the proportion of the range in the symmetric/weakly asymmetric, or asymmetric 
cases respectively. 

4.1. Proofs of Propositions [4TT1 and 14.21 The proofs of the above propositions 
are similar. We first derive the chance a single site is left open, and then later use 
this development in an induction scheme. Let Ty be the rth hitting time of y, and 
T y = inf{n > 1 : X n = y} be the return time to y. The event that site y is left 
open, with various prescribed exits, is expressed as 

{G(y) = 1 mod 2 ,T N < T } 

= U fe >o{T! < t n } n {Tf' +1 < t n } n {T N < Tf + 2 A T }. 



Similarly, 



{G{y) = 1 mod 2 ,T < T N } 

= Ufe^i; 1 < t n } n {Tf +1 < t n } n {To < Tf + 2 A T N } 
{G(y) = 1 mod 2 } = Ufe^i; 1 < t n } H {Tf +1 < t n } H {t n < Tf + 2 }. 
Then, 

P x (G(y) = 1 mod 2 , T N < T ) = P x {T y < T N )P y {T N < f y ) ^ P y (f y < t n ) 



21 

2>0 

P X {Ty < T N )Py{T N < Ty) 



\-{l-P y {T N <Ty)f 
P X {Ty < TN) Py{T N < Ty) 

2-Py(T N <f y ) p y {T N <f y y 
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Also, 

P X (Ty < T N ) Py{T < Ty 



P x {G(y) = lmod 2 , T <T N ) 



2-Py(T N <Ty) Py { TN < fy) 

P x (G(y) = 1 mod 2 ) = 



P x (T y < t n ) 



2-P v (r N <Ty) 

In this last expression P x (T y < Tjy) is the probability y is visited starting from x, 
and (2 — P v (tn < Ty)) -1 is the factor specifying that y is left open. The quantity 
P v {tn < T v ) can be viewed as an "escape probability." 

Suppose now x = [aN] and y = \j3N]. In the symmetric case, we compute 

r N - x for y<x<N, 



P x (T y < t n ) = | 

and 



| for < x < y 



11 N 

Py(Ty < T N ) = -Py-^Ty < T ) + -Py +1 (Ty < T N ) = 1 - ^ _ y) ' 

In the (weakly) asymmetric case, we have 

fel for x > y 



P X {Ty < T N ) 

and 



for x < y 



Py(TN < Ty) = q N Py-l(T Q < Ty) +p N P y+1 (T N < Ty) 



p N {l - Sjv)(1 - s%) 



(l-^)(l-4" tf ) 



Then, 

Py(T N < Ty) 



for symmetric/weakly asymmetric walks 
p — q for asymmetric walks. 

Putting these observations together, we have the following result. 

Proposition 4.3. Under symmetric or weakly asymmetric motion, 

lim P x (G{y) = 1 mod 2 |T„ < T N < T ) 

Nfoo 

= lim P x {G(y) = lmod 2 \T y <T <T N ) = -, 

Nfoo 2 

and hence limjvtoo P x {G(y) = 1 mod^Tj, < tn) = 1/2. 
However, under asymmetric motion, for x < y, 

lim P x (G(y) = 1 mod 2 ) 



Ntoo 2 — (p — q) 

Proof of Proposition ]^. 1\ Let G n (y) = Y^i = q N l v (Xi) be the number of visits to 
y up to time n A t^. First, we write 

P[ QjV ](n-U{ T KiV] < r w },nt 1 G([a l 7V]) = a mod 2 ) 

= P [aN] (nti{G([aiN]) = e t mod 2 },T [aiN] < T N < T ) 

+P[ a N](ntiG([a. t N}) = e t mod 2 ,T [akN] < T < T N ). (4.1) 
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We now concentrate on the first term on the right when T/v < To, as the argument 
is similar for the second term. Since, on the set Tv < To, the walk must leave [aiN] 
never to return, and is also nearest-neighbor, write 

P[aN] 

{T [aiN] <T N < T ,nti{G([a 4 jV]) = e , mod 2 }) 

= E P i«m ( T [aU] < n i=2{G T * a \ N] ([aiN]) = *}) 

•i z k 

z\—e\ mod2 
' P[ ai N](T[a 2 N] < T[ aiN ] A T N ) 

■ P [a2 N](T N < T [aiN] ,nt 2 {G([aiN]) = e(zi)}) (4.2) 

where e(zi) = or 1 — e\ if z\ is even or odd respectively. 

In the last factor, which deals with the parities of k — 1 points, [aiiV] can be 
translated to x = 0. Treating the limit in Proposition 14.31 as a base step, we may 
conclude by induction, for fixed e(zi), that 

Jim P la2 N](nL 2 {G([ ai N}) = e( Zl )}\T N < T [aiN] ) = 2-<- k ~ 1 \ 
Hence, by bounded convergence, we may replace the last factor of (|4.2p , by 
2- (fc - 1) T[ Q2 Ar](Tv < T [aiN] ) + o(l) 
as N f oo . Summing over z% , ■ ■ ■ , Zk , we have 

P[aN] (T[ Ql jv] < T W < r 0l nJ =1 {G([oiJV]) - e,}) 

E p [«JV](T^ iAr] < rjv) 

_2i — ei mods 

• [2" (fe - X) P[a 2 AT] (Tat < T [aiN] ) +0(1)] 

= pzx- p [aiV](G r ([aiA r ]) = 61 mod2 ' T [ Q i JV ] <t n < T o) +o(l). 
Therefore, noting Proposition 14.31 

lira P[ aUV] (n^ =1 {G([o i JV]) = e i }|T [aiAn <T w <r ) = i. 

A similar expression is derived when the conditioning event is Tu^l < T < Tv, 
and so the limit in Proposition 14. II is recovered. V 
Proof of Proposition^^ The proof is easier than that for Proposition ^. II Since 
the probability of backtracking, T[ 7 jv](T[^/v] < t/v) is exponentially small in N for 
P < 7, and noting Proposition 14. 3[ we have 

k 

P [a N](nti{G(i^N}) = 1 mod 2 }) = o(l) +l[P [aiN] (G([a i N]) = 1 mod 2 ) 

»=i 

(2-(p- 9 ))-* 

V 
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